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Abstract. We are interested in the spread of an epidemic between two communities that have 
higher connectivity within than between them. We model the two communities as independent 
Erdos-Renyi random graphs, each with n vertices and edge probabihty p — n"^^ (0 < a < 1), 
then add a small set of bridge edges, B, between the communities. We model the epidemic 
on this network as a contact process (Susceptible-Infected-Susceptible infection) with infection 
rate A and recovery rate 1. If npX = b > 1 then the contact process on the Erdos-Renyi 
random graph is supercritical, and we show that it survives for exponentially long. Further, 
let T be the time to infect a positive fraction of vertices in the second community when the 
infection starts from a single vertex in the first community. We show that on the event that 
the contact process survives exponentially long, T\B\/(np) converges in distribution to an 
exponential random variable with a specified rate. These results generalize to a graph with N 
communities. 

1. Introduction 

Let Gi = {Vi,Ei) and G2 = {V2,E2) be independent instances of Q{n,p), the Erdos-Renyi 
random graph ensemble with n vertices and edge probabihty p. Construct the graph G = {V, E) 
such that y = li U V2 and E = EiU E2U B, where B C Fi x ^2 is a set of 'bridge' edges 
chosen independently of Gi and G2. When i? is a small set of edges relative to Ei (i = 1,2), 
then the graph will have two distinct communities with a higher concentration of edges within 
each community than between the two communities. During the 2009-2010 Stochastic Analysis 
program at SAMSI, John McSweeney and Bruce Rogers simulated the contact process (defined 
carefully below) on this graph as a model for a Susceptible-Infected-Susceptible epidemic in a 
network with two communities. Figure 1 depicts their results on a network with 1000 total 
vertices (n = 500 in each community), mean degree np = 50 and |S| = 2 bridge edges. Each 
line represents an independent simulation in which initially there are 2 infected vertices in Vi 
and all other vertices are healthy; infected vertices become healthy at rate 1 and transmit the 
infection to their neighbors at rate A = 0.06. In each simulation the infection very quickly 
reaches a quasi-equilibrium state in the first component Vi, then makes a jump to the second 
component V2 at a random time. We were motivated by the picture to prove this rigorously, 
and to determine the limiting distribution of the random jump time. We also prove that the 
contact process survives for exponentially long (in n) on the random graph before eventually 
hitting the absorbing state in which all vertices are healthy. 

The contact process on a graph G is a continuous time Markov process C V, where 
denotes the set of infected vertices at time t. If A = X{n,p) > is the infection rate, then 
an infected vertex sends the infection to each of its neighbors in G according to independent 
Poisson processes with rate A, and it becomes uninfected according to an independent Poisson 
process with rate 1. In effect, a healthy vertex v ^ becomes infected at rate \ \J\f{v) n^t|, 




"I 1 1 1 1 r 

20 40 60 80 10 

Time 



Figure 1. Simulation results for the contact process on G with \B\ = 2, n = 
500, p = 0.1 and A = 0.06 so that 6 = 3. Each line represents an independent 
trial started from two initially infected vertices in Vi. Picture is due to John 
McSweeney and Bruce Rogers. 

where J\f{v) denotes the set of neighbors of v in G, while an infected vertex becomes healthy at 
rate 1. This is made rigorous by Harris' graphical construction, which is described in Section 4. 

The contact process has been recently studied on two different models of power-law random 
graphs by Chatterjee and Durrett [3] and Berger, Borgs, Chayes and Saberi [1]. The term 
power-law random graph refers to the degree distribution having tails that decay like ck~°'. For 
the random graphs considered in [3] and [1] it was shown that the contact process survives on 
these graphs for a long time for any A > 0. This was in contrast to the mean- field calculations 
of [9] and [10], which predicted that for a > 3 there is a Ac > so the contact process will die 
out quickly for A < Ac. Subsequently, Peterson showed that Ac > for the contact process on 
the complete graph with random vertex weights following a power-law [11], and in fact explicitly 
gives the value of Ac in terms of the second moment of the vertex weights. 

1.1. Main Results. We study the high-degree regime where the mean degree of the random 
graph scales as np = with a £ (0, 1]. Since we want the graph to be connected, we need 
the average degree to be at least np > clogn for c > 1. However, part of the proof requires 
that the random walk on Q{n,p) be very close to uniformly distributed on the vertices after 
a short amount of time (see Lemmas 2.5 and 4.1). This part of the proof is simplified when 
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np = n", though a similar method may work for smaller average degrees. Additionally, in the 
early stages of the infection process we must have large neighborhoods so that transmission 
events between neighboring infected vertices are rare. For smaller degrees, additional care will 
be needed to guarantee that the infection spreads out quickly. 

Before wc state our main results, we will need some notation. Let P [•] denote the law of 
G = GiU G2U B, where Gi, G2 ^ G{n,p) are independent random graphs, and the edges in B 
are chosen independently of Gi and G2 such that each edge in B has exactly one endpoint in 
Gi and one in G2. The dependence on n and p has been suppressed from our notation. Once G 
is chosen according P [•], it is fixed for all time, so we let (•) be the law of the contact process 
conditional on G with = A C V, and abbreviate Fy (•) := P^^j. (•). We will also often want 
to observe the contact process on only one of the two subcomponents. That is, we will ignore 
the edges in B, so = H Vi for all t whenever ^0 = ^ C Vi, and we denote the law of this 
restricted process conditional on G by P^^ (•). We will also use a„ = 0{hn) to mean an < Ghn 
for C > 0, On = ^{hn) to mean an > cbn for c > 0, and an = o{bn) to mean a-n/bn — >■ 0. 

First we must guarantee that the contact process on the random graph can survive long 
enough to spread from the first community to the second. If npX = b < 1, then the contact 
process is dominated by a subcritical branching process, and dies out quickly, so we assume 
that npX = b > 1. In this case we have the following theorem, which says that the supercritical 
contact process survives exponentially long on the random graph when it survives for at least 
ri(loglogn) time. We assume survival for at least r2(loglogn) time because at this point the 
infection has either died out (with probability 1/6) or reached order logn vertices. 

Theorem 1.1. Consider the contact process, ^,t, on the Erdds-Renyi random graph Gi ~ Q{n,p) 
with np = n", npX = b > 1, and ^0 = {v} C Vi, and let r = 5^ log logn. Then there exist 
constants 773, e, c > depending on b so that for any S > 



The constants 773 and e in Theorem 1.1 are defined in Lemma 3.2, and the proof appears 
thereafter. We use an approach similar to the one employed by Peterson [11] to prove expo- 
nential survival of the supercritical contact process on the complete graph with random vertex 
weights. That is, we show that if the size of the contact process initially exceeds 772 for some 
7 > e, then in a small constant amount of time, the size of the contact process is likely (with 
probability exponentially close to 1) to have increased by the end of the time interval without 
ever having dropped below size en. The main difference between our proof and Peterson's is 
that wc mTist rely on an isoperimetric inequality (Lemma 2.2) to guarantee that the contact 
process has room to expand. 

Our main question is, if the infection is started at a single vertex vq &Vi, and the infection 
is able to spread, how long will it be before a positive fraction of the vertices in V2 are infected? 
In other words, how long does the infection take to cross a bridge between the two populations 
and spread throughout the second population? Theorem 1.2 answers this question. 

Theorem 1.2. Fix a, G (0,1]. Suppose np = n", npX = b > 1, and there are \B\ = 
o(n"/ logn log logn) bridges between V\ and V2. Choose any vq G Vi. There exists e > 
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so that if T := inf{t > : fl V2I > ^n}, then for any x G [0, 00) and any S > 
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This says that when the infection survives long enough (which happens with probabihty 
~ 1 — 1/b), the distribution of r, the time at which the infection has spread to a positive fraction 
of vertices in V2, is approximately exponential with rate \B\ (6—1)^/ {brf"). The upper bound on 
the number of bridges that we can accommodate is because our proof requires that the amount 
of time between the first log n successive potential transmissions of the infection between the two 
communities is at least O (log log n), and the rate at which such potential transmissions occur 
is \B\\ = b\B\ /n'^. The maximum number of bridges allowable to guarantee the separation of 
timescales seen in Figure 1 should be Oin"") as the following mean- field argument demonstrates. 
For small times t, the number of infected vertices in V\ at time t is approximately fe*, since each 
infected vertex has expected degree np and spreads the infection to each neighbor at rate A. 
Therefore, the number of infected vertices in Vi will reach en at time s = log (en)/ log 6. The 
expected rate at which the infection is transmitted from Vi to V2 at time t is approximately 
6*A \B\ jn = 6*"*"^ \B\ /n^"*"", so the expected number of times the infection is transmitted from 
V\ to V2 before time s is 



, 6*+MB|log6 |B|e61og6 

— 5- — at = — 



n- ■ - n^+°- 

So, if \B\ S> n" then the infection is likely to spread to V2 before reaching size en in Vi. It is 
worthwhile to note that if Vi and V2 are two halves of a homogeneous Erdos-Renyi random graph 
with mean degree n" then |-B| ~ n^"*"", so no separation of timescales should be observable. 

Theorem 1.2 generalizes easily to A'^ communities, Gi, . . . ,Gn, each independent and dis- 
tributed as Q{n,p) with np = n" {N is fixed). If Bij is the (possibly empty) set of bridge edges 
between communities i and j, and Bi = UjBij, then we assume maxj \Bi\ = o{n"'/ log n log log n) 
and for all 1 < i,j,k,£ < N there are constants c, C > such that whenever \Bij\ > and 
\Bke\ > then 

c<§4<a 

Under this assumption, all pairs of communities have either a comparable number of bridges, or 
no bridges (are not directly connected), so we let Pij = \Bij \ / max^^^ \Bki\- We can then define 
a process xt € {0; 1}"^ such that Xt{i) = 1 if and only if community i has at least en infected 
vertices at time tn^/ max^ ^ \Bke\- If npX = b > 1 and xo = (!> 0, 0, . . . , 0), then Xt converges 
in distribution on a finite time interval to the monotone stochastic process in which Xt(i) flips 
from to 1 at rate J2j Xt{j)Pij(,b— l)^/b. That is, at the community level, the infection process 
resembles an SI epidemic with inhomogcneous infection rates. 

The assumption that the number of bridges between connected communities is of a single 
order of magnitude is not necessary, except that things become a bit more complicated when the 
number of bridges between communities can span multiple orders of magnitude. In particular, 
we need to consider multiple time scales. The time for the infection to pass from community i 
to community j is proportional to the 'length' of the shortest path between the two, where a 
path from i to j is a sequence of communities i = xo,xi, . . . ,Xk = j such that > 0, 
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the 'length' of this path is 1/ min^ and the shortest path is the one with the minimal 

'length'. This is because most time will be spent while the infection attempts to cross the 
narrowest bottleneck. 



1.2. Proof Outline. For simplicity, we show the proof for the case where there is a single 
bridge edge, B = {{ub,Vb)} with Ub G Vi and vi, G V2. The generalization to multiple bridges is 
straightforward, and we comment on this briefly in Section 5. 

Step 1. When the contact process starts from a single vertex, ^0 = {^^o} C Gi, for a small 
amount of time, r = O (log log n), \^t\ is well approximated by a continuous time branching 
process with survival probability 1 — 1/5, where b = npX (Lemmas 3.1 and 3.2). By time r, 
the contact process has either died out, or is destined to survive for exponentially long with 
positive density by Theorem 1.1. The main difficulty in this part of the proof is managing 
'collision' events, where an infected vertex attempts to spread the infection to another already 
infected vertex. Once the process grows beyond size n", these collision events potentially stifle 
the growth of S,t- We overcome this obstacle by the use of the isoperimetric bound given in 
Lemma 2.2. This bound says that as long as the contact process contains at most a small 
fraction of vertices in Vi, then the number of edges between and V\£,t will be large. We use 
this bound to show that while \^t\ is not too large, it stochastically dominates a random walk 
with positive drift. 

Step 2. We use the self-duality of the contact process, which essentially means that the 
time reversal of the contact process has the same distribution as the contact process - we will 
carefully describe duality in Section 4. The dual process started at time t > 0, {Ct}s€[o,t]j is 
constructed so that if it is started from a single vertex Cq = {v} C V, then H ^t-s 7^ for 
some s G (0, i) if and only if v & In the graphical construction of the contact process, 
edges attempt to transmit the infection at rate A independent of ^f, so we observe the dual 
process started from Ub when the bridge edge {ub,Vb) attempts to transmit the infection. The 
dual process will reach size Q{logn) by time r with probability close to 1 — 1/6. The primary 
difficulty at this point is getting the dual process, which has grown backwards in time, to 
intersect with the contact process, which has size at least en {G is not the complete graph, so 
it is possible for all of the vertices in the contact process to be far from the vertices in the dual 
process). We do this by coupling the particles in the dual process with a random walk process 
in which each particle jumps according to an independent simple random walk on the vertices 
of G and dies at rate 1. Those particles which survive for time t^^ix = O (log log n) and do not 
collide with other particles will all be well mixed (Lemmas 2.5 and 4.1), and each intersects 
with the contact process with probability larger than e, which implies that the two processes 
intersect with high probability. 

Step 3. At the times when the bridge edge (ub, Vb) attempts to transmit the infection (a 
Poisson process {T^""'"'^}fcwith rate A) we start an independent dual process. Each dual process 
to survive to time r will intersect with and result in the infection spreading to Vb- In turn, Vb 
will spread the infection to V2 with probability approaching 1 — 1/6 by repeating the first step 
of the proof. Therefore, the number of times that the bridge edge must attempt to transmit the 
infection approaches a Geometric([l — 1/6]^) distribution, and the interarrival times between 
successive attempts are distributed as independent Exp(A). So on the event that / 0, the 
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time required to spread the infection to V2 approaches an Exp(A[l — 1/6]^) distribution, and 
the probabiUty that / approaches 1 — 1/6; this is the statement in Theorem 1.2. 

Section 2 is devoted to proving two key lemmas about Erdos-Renyi random graphs with high 
degree - an isoperimetric bound and a strong mixing time estimate. In Section 3 we compare 
the early stages of the contact process with a branching process and prove Theorem 1.1. In 
Section 4 we prove a key lemma about the dual of the contact process, and in Section 5 we 
bring everything together for the proof of Theorem 1.2. The Appendix contains some basic 
properties of branching processes that are used in Section 3. 

Acknowledgements. Thank you to Rick Durrett for numerous helpful conversations dur- 
ing the writing of this paper, and for his comments on the final draft. Thank you to John 
McSweeney and Bruce Rogers for performing the original simulations of this model. This work 
was started at SAMSI during the 2010-2011 Program on Complex Networks. 



2. Isoperimetric Inequality and Mixing Time Bounds 

In this section we prove Lemma 2.2, which gives a bound on the e-isoperimetric number 
(defined below) of an Erdos-Renyi random graph with large mean degree, and Lemma 2.5, which 
says that the random walk on the high-degree random graph is almost uniformly distributed 
after a constant number of steps. These two properties of the random graph, which is fixed for 
all time, hold with P-probability tending to 1 as n —t- 00, so when we later consider the contact 
process on a random graph, we can condition on it having these properties. 

Definition 2.1. Define the e-isoperimetric number, ie{G), of a graph, G = {V,E) as 



it{G) = min 



I \u\ 



U CV, \U\ <e\V\ 



where dU G E is the set of edges with exactly one end vertex in U. 

Lemma 2.2. IfG^ Gin,p) is an Erdos-Renyi random graph on n vertices with edge probability 
p such that up > 28(logn)^, then for any fixed e > 

i,{G) > {I - e)np - {npf^ 

with probability 1 — o(l) os n —)• 00. 

Note that this bound is essentially tight since if U is any deterministic set of en vertices, then 
\dU\ ~ Binomial(|C/| (1 — e)n,p) so \dU\ = \U\ (1 — e)np{l + o(l)) with high probability. The 
proof is adapted from the proof of a similar result for the ^-isoperimetric number of random 
regular graphs in [2]. 

Proof. Denote by P{u,m) the probability of the event that G contains a set of vertices U cV 
with \U\ = u and \dU\ < m. We will have proven the lemma once we show that 

en 

^P(u,m(u)) = o(l) 

u=l 
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where m{u) := u [(1 — e)np — (np)^/^] . By Markov's inequality 
P{u,m{u))< s )P ^^-P^ 



s=0 



^. •iXu<m{u)) 

where ~ Binomial(u(n — u),p). Bernstein's inequality, as it appears in [12], says that 

V {\Xu-^Xu\ > t) < 2exp 



2(VarX„ + f/3) 

Let t = — m{u) = u{enp — up+ (rap)^/^). Note that t > u{np)^/^, since u < en, so 

P {Xu < m{u)) < P (^\Xu - EX^I > n(np)2/3^ 

-^^(np)V3 1 

2 [u{n - u)p{l -p) + «(np)2/3/3] J 

2(np)V3 + 2/3j 
<2exp|-^u(np)i/3|, 

provided > 1/3. Applying the assumption that np > 28(logn)^ completes the proof: 

en oo 

^P(u,m(«)) < 2^ [ne-("f)'^'/=^]" 
«=i 

2ne-(-^')^/V3 (^1 _ ne-^npy^'/'Y' = o{l). 




u=l u=l 



□ 



We will make use of a random walk on a random graph in the proof of Theorem 1.2, and 
we will need a bound on the mixing time of this random walk. Loosely speaking, this is the 
number of steps that it takes for the random walk to be 'close' to its stationary state. Let 

Wfj- - i^wtv ■=ly2 ~ = yi(^(^) ~ ^(^))+ = sup im(^) - 

^ vev vev 
denote the total variation distance between two probability measures, /x and v, on the vertices 
of a graph G = {V, E). 

Let Xk be a discrctc-timc, simple random walk on the vertices of G ~ Q{n,p) where np = n" 
and a € (0, 1]. We denote the fc-step transition probabilities of Xj. by P^{u,v) for u,v ^V. 
Also, we will denote the probability measure on V that corresponds to the k^^ step of the 
random walk started at u by P^{u, •). Let tt be the stationary distribution for this random 
walk whenever it is uniquely defined. Note that the random walk is ergodic (so vr is unique) 
asymptotically almost surely because X^ is aperiodic [G is not bipartite) and irreducible {G is 
connected) with probability superpolynomially close to 1. 
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Typically, the mixing time for a random walk, X^, is defined to be the smallest k such that 



TT 



< a for some fixed a € (0, 1/2). In the proof of Lemma 4.1 we need 



tight control on P^{u, •) for many vertices, u, simultaneously, so our goal is to show that for 
some constant /cmix depending only on a. 



sup 

uev 



n 



< \A\o{n-^) 



for every A C V. To say this we need the following lemma, which says that the stationary 
distribution of is almost uniform. 

Lemma 2.3. Let G ^ Q{n,p) with np = n"", a G (0, 1], and let tt be the stationary distribution 
for the simple random walk on G (conditional on its existence and uniqueness). If is the 
uniform probability distribution on G (^{v) = n^^ for all v £ V), then 

\Tr{A) - fi{A)\ < 3n-(^+'^/3) |^| 
for all A gV with probability 1 — exp 

Proof. It is easily verified that 7r(f) = deg(t;)/(2 |£'|) for all v € V. By Bernstein's inequal- 
ity [12], deg(t;) G [n" - r?'^!"^ ,n"- + r?°-l'^\ for all G F with probability 1 — exp[— r2(ra"/^)]. This 
implies that, for all ^ C y and all sufficiently large n (> 2^/"), 



7r(^) < 



< 



n 

n 



1 + 

1 - n-"/3 
1 + 3n-"/^ 



with probability 1 — exp[— Likewise, we have the corresponding lower bound i^i^A) > 
{\A\ /n)[l — 3n~"/^] for all A with high probability, which proves the lemma, since IJ,{A) = 
\A\ In. □ 

Note that Lemma 2.3 is slightly stronger than the statement that 

Ik - \x\\j,y < 3n""/^ 

with probability 1 — exp[— f2(n'^/^)]. 

In [4] it is proved that when G ~ Q{n, dj {n — 1)), d G {1, . . . , n — 1}, and k>Q and v 
are fixed, then 



E 



P\v,.)-y. 



<cl- 



for some absolute constant c. (Note that /i, the uniform probability distribution, appears in 
the statement, and not tt.) The problem is that this result does not provide a bound on the 
mixing time, which entails taking the supremum over all initial locations for the random walk. 
This presents a problem for us, since we want to say that many independent random walks, 
started from different locations, will all be well mixed at the same time. To remedy this for 
high degree random graphs, we have the next lemma. 



8 



Lemma 2.4. Let G ~ Q{n,p) with np = n" for a G (0, 1]. If k = [1 + 1/aJ, then 

sup \\P'^{v, •) - ttII^.^ = 0(n-"/=^) 

TOt/i probability 1 — o(l). 

The definition of k in Lemma 2.4 is such that k is the smallest integer strictly larger than 
1/a. 

Proof. We will employ a simple path counting argument similar to an argument used by Lu- 
betzky and Sly [8] to prove cutoff for the random walk on random regular graphs. Let Vkiu, v) 
denote the number of paths in G of length k that start at u and end at v. As in the proof of 
Lemma 2.3, we can use Bernstein's inequality to bound the maximum and minimum degrees 
of G as deg(i;) G [n"' - v?"-/^,n°- + n^"/^] for all v £ V with probability 1 - exp[-0(n"/^)]. 
Observe that the probability that the random walk, X^, traverses any path of length k, 
(vo, vi), {vi,V2), {vk-i,Vk) e E, is 

P\vo,v,) ■ P\vi,V2) ■ ■■P\vk-uvk) > ^(1 - 0(fcn-"/3)) 

with probability 1 — exp[— f2(n"/^)]. Therefore, 

PH'u,v) > rk{n,v)^{l - 0(A;n-"/3)) 

for any u,v G V with high probability. To obtain a lower bound on Vk{u, v), we introduce the 
following notation for balls of radius k in G. For any u eV and A; > 0, let 

Bkiu) := {v eV \ dist{u,v) < k} 
dBk{u) ■.= Bk{u)\Bk-i{u) 

where dist(u, v) denotes the length of the shortest path in G from u to v, and dBo{u) := {u}. 

For a pair of vertices u,v V, let ku = \^^^~\ and k^ = |_^^^J , so that ku + ky = k — I. Wc 
will construct a ball of radius ku around u, then remove this ball from V and construct a second 
ball around v of radius ky whenever v ^ Bk^{u). Since most vertices in V are not within ku of 
u, we will have two disjoint sets of vertices for most pairs u, v. The number of paths between 
u and v in G is at least the number of edges between the vertex boundaries of these two balls, 
so we seek a uniform lower bound on this quantity. 

To obtain bounds on \dBk^{u)\^ we start at u and reveal edges layer by layer. To start, by 
Bernstein's inequality, 

_ „2a/3 < \dBi{u)\ < + r?"/^ 

with probability 1 — exp[— r2(n"/^)]. By induction on A;, we wish to show that | \dBk{u) \ — n"'^\ = 
Q(^^ak-a/3^ for all k < ku- Assume that 

and 

\Bk-i{u)\ < n('=-^)»(l + 0(n-»/^)). 
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Since k < ku, we have that (k — l)a < 1/2, so Bk-i{u)\ = 
there are at least n('^~^)"+-'^(l - 0(n^"/'^)) and at most 



n — 0(n^/^). This means that 
^(1 + 0(n""/^)) potential edges 



between dBk-i{u) and V \ Bk-i{u 

\dBk(u)\ > n(*-^)"+V(l - 0(n-«/3)) 



Therefore, by Bernstein's inequality. 

2/3 



(fc-l)a+l 



(1 - 0(n-"/3) 



with probability 1 — exp[— $7(71*^"/^)], and likewise 

|aSfc(n)| < n'="(l + 0(n-«/3)) 

with probability 1 — exp[— Q(n'^"/^)]. It immediate follows that \Bk(u)\ < n^"(l + 0(n^"/^)), 
which concludes the induction argument. Therefore, | — n*^""! = 0(71*^""""/^) with 

probability 1 — exp[— r2(n"/^)]. 

After exposing B^^ (u) we can employ the same argument starting from v & V \ B^^ (u) , but 
at each step avoiding the vertices in B^^ (u) . Let 

BUv) ■.= {w£V\ BkSn) I ^isiu{v,w) < k} 

dBUv) ■.= B^iv)\Bt_,iv), 

where distu(i',w) denotes the graph distance on the maximal subgraph of G with vertex set 

V \ Bk^{u). Note that when a > 1/2, ky = 0, so B'^^{v) = {v}. When a < 1/2, removing the 

vertices in Bk^{u) from V does not affect any of the estimates made above, because \Bk^(u)\ = 
0(n'^"") = 0(n(i+")/2) ^ 0(ni-"/3), so that |F \ (Sfcjn) U B|f_i(t;))| = n(l - 0(n-"/3)) at the 
induction step. Therefore, (-f;)! — n^^"! = ©(n'^""""/^) with probability l—exp[—ri(n"/^)]. 

Now we observe that every edge between the sets dBk^{u) and dB'^^{v) contributes at least 

onepathof lengthKto7'„('u,'y). There are at least \dBk^{u)\-\dBl^{v)\ > n('=-^)"(l-0(n-"/^)) 
potential edges between the two sets. So by again applying Bernstein's inequality we have that 

V^{u,v) > n'^"-\l - 0(n-"/^)) = a;(l) 

with probability 1 — exp[— r2(n"/^)], and therefore that 

with probability 1 — exp[— r2(n"/^)]. By applying a union bound over all pairs u ^ V and 

V gV \ Bk^{u), this bound on holds for all such v with probability 1 — o(l). Therefore 

sup \\n - P^iu, OIItf = sup V ( - - P''{u, v)] 

with probability 1 — o(l). Applying Lemma 2.3 and the triangle inequality completes the 
proof. □ 
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An immediate consequence of Lemmas 2.3 and 2.4 is the following lemma, which we will use 
in the proof of Theorem 1.2. 

Lemma 2.5. Let G ~ Q{n,p) with np = for a G (0, 1], and P^{u, ■) he probability measure 
on V corresponding to the /c*'' step of the simple random walk on G started at u. Let tt be the 
stationary distribution of the random walk, and fx be the uniform probability measure onV. If 
kmix = 12 [l/al^ then 



sup 
u&V 



pkmixty^ •) - TT = 0(n ^) 



with probability 1 — o(l). Furthermore, 



sup 
uev 



< An' 



-(l+a/3) 



1^1 



for all A (ZV with probability 1 — o(l). 



Proof. The first equation is due to the following fact, which is a consequence of Lemma 4.11 
and equation (4.31) in [6]. If sup„gy •) — 7r|| < a for some A; G N and a G (0, 1/2), then 

for any integer £ > 0, 



sup 

uev 



In our case, by Lemma 2.4, a = Const • n k = k <2\l/a\ and i = 6 |"l/a]. The second 
equation follows from the first and Lemma 2.3. □ 



3. Survival of the Infection 

In this section we will show that the contact process on an Erdos-Renyi random graph 
survives for exponentially long whenever it docs not die out very quickly. We will do this 
by making use of three comparisons with branching processes. We will use lower and upper 
bounding branching processes to carefully control the behavior of the contact process started 
from a single vertex in its early stages. Then we will use a second lower bounding branching 
process to show that the contact process will survive to linear size whenever it survives beyond 
its initial growth stage. The relevant facts about these branching processes can be found in 
the Appendix. Finally, once the contact process occupies a positive fraction of the vertices, we 
will compare it to a random walk with positive drift to show that it will continue to occupy a 
positive fraction of vertices for exponentially long. 



Lemma 3.1. Let r = t——, , 

3/ log 6. If the contact process starts with a single vertex, = {v}, then 



riTs loglogn = g^loglogn + o{l), r/i = and r?2 = 



(1) 
(2) 
(3) 

with -probability 1 — o(l). 



sup l^tl > 2r/i(logn) 

t<r 



0{n 



\ (0 < ICrl < ??2logn) = 0((logn)-i), 
n(l^r-|=0) = ^ + O((logn)-i). 
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Proof. By Bernstein's inequality [12], the degree of any vertex in Gi or G2 is in the interval 
[np — {npf'^^ ,np + (np)^/^] with P-probability 1 — exp[— r2((np)^/'^)]. For eaeh n, let 1^" be 
a branching process in which each individual gives birth to a single offspring at rate /3y = 
b — 2(np)~^/^ and each individual dies at rate 1. Likewise, let Z^ be a branching process in 
which each individual gives birth to a single offspring at rate = 6 + {np)~^^^ and each 
individual dies at rate 1. It is clear that stochastically dominates |^(| restricted to the graph 
Gi (ignoring edges between Gi and G2) for all time provided Zq = |4o|- By Lemma A. 4, when 

2 1 1 I fih 6/35 

n = ^ loglogn and 771 = (^riyi^ > (^|3T)t^' 



sup 161 > r?i(logn)* < Pi sup > m(logn)* = 0(n-^). 

t<ri J \t<ri J 

We want to extend this bound slightly to time r = pr?_i log log n. To do so, we observe 

that r = ri + 3(np)~^/^ loglogn + o{{np)^^^'^). The number of birth events during a time 
interval of length r — ri for the process Z^, when < 2r/i(logn)^ throughout the interval, 
is stochastically bounded above by a Poisson(4677i (log n)'*(r — ri)) random variable for large n 
(the leading coefficient of 4, rather than 2, over-compensates for the difference in rates between 
and b). Therefore, the probability of there being more than 7 births during this interval is 
at most 0(n~'^). If at the start of the time interval < r]i{logn)^, then the branching process 
cannot have exceeded size 2?7i(logri)'^ at any time during the interval, so we have equation (1). 

This means that with high probability, at any time t G [0, r] and for any vertex v £ Vi, we 
have that \J\f{v) n 6| < 2r7i(logn)^ < (np)^/^ for sufficiently large n. Therefore, for all t G [0, r], 
\Af{v) \^t\ > np - 2(np)^/^, and |6| stochastically dominates Y^". If r?2 = 3/log6 < 3/log/3y, 
then by Lemma A. 2 and equation (1) 

(161 < m'^ogn) < Pi {Y;" < ?72logn) + 0{n~'^) 

= Pi (y," = 0) + O((logn)-i) 

^ 6-2(n,)-V3 +Q(aognr^) 
= ^ + 0((logn)-i). 

The third line above follows because the extinction probability for is l//3y, and the last line 
follows by expanding the geometric scries for the first term. Likewise, we have 

K (161 < m logn) > Pi (Z;* = 0) + P, (0 < 161 < V2 logn) 
> Pi (z,-^ = 0) + P^O < 161 < m logn) 
1 _ e-(/'z-i)'^i 



^„ _ ^-(/3B-l)ri 



+ P40< 161 <??2 logn) 
^ - 0((logn)-2) +P^ (0 < 161 < r/2logn) . 



The second line follows because the events {Zt = 0} are increasing in t, and the third line is 
an exact compTitation of the transition probability. Combining the last two inequalities (and 
equation (1) to guarantee that 6 dominates Yj") completes the proof. □ 
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By Lemma 2.2, the e-isoperimetric number of Gj for j = 1,2 is bounded below as 

i2e{Gj)>{l-2e)np-{npf/^ 
for asymptotically almost every Gj. We choose e = ^(l — l/h^^'^) > so that 

(4) (1 - 2e - (np)-^/3 - 0{{\ogn)^{np)-^))h > (1 - 3e)6 > 1 
for sufficiently large n. Then Lemma 2.2 applied to i2e{Gj) implies that 

(5) |Ar(6)\6l>(l-3e)np|6| 

for large n whenever < 2en for asymptotically almost every Gj. The term 0((logn)'^(np)^^) 
appears in (4) because in the proof of Theorem 1.2 it will be necessary to avoid as many as 
0((logn)^) vertices that have been observed by the dual process defined in the next section. 
Since (5) is a property of the graph, which is fixed for all time, we will assume that this inequality 
holds for the remainder of the proof. This means that during the time interval [r, T2(\ , where 

T2, = M{t > r : \^t\ > 2en}, 

\^t\ stochastically dominates a branching process with per-capita birth rate (1 — 3e)b > 1 and 
death rate 1. Denote this branching process by Wt. Then 

P {T2e > s I logn < l^rl < r?i(logn)'') < P (W, < 2en \ Wr = m logn) 

<P(W, < 2en I Wr = Ij^^iog'*, 

because the event that Ws < 2en given that Wr = rj2 log n implies that all of the r]2 log n families 
at time r must not have exceeded size 2en by time s. If we let s = (i_3^)fc_i logra + r, then 
Lemma A. 2 implies that 

(1 \ '72 log n 

< „-3-3iog,(i-3e) exp[0(n-2 logn)] 
= 0(n-2). 

In the second line above we used the bound 1 + x < , and in the last line we used the fact that 
e = j(l — so — 31ogb(l — 3e) < 1. Together with Lemma 3.1, the last two inequalities 

imply the next lemma. 

Lemma 3.2. With r defined as in Lemma 3.1, e = \{1 — and rjs = (i_4g){,_i then 

(6) P, (T2e > ??3logn I iCrl > 0) = ©((logn)"!). 

This means that if the contact process is able to survive to time r, which happens with 
probability 1/6 + 0((logn)~^), then it will reach size 2en before time 773 logn. Theorem 1.1 

shows that if the contact process, ^t, survives to time r, then it will continue to survive for 
exponentially long. In particular, it will survive long enough to spread to the second component. 

Theorem 1.1. Consider the contact process, ^t, on the Erdos-Renyi random graph Gi ~ 
Q{n,p) with np = n", npX = b and ^0 = {v} C Vi, and let r = log logn. Then for % > 
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and e > as in Lemma 3.2 and a constant c > depending on b, for any S > 



■ V 



min < en 

te[T?3 log 71,6="] 



\&\ >0\>S 



0. 



For the proof of Theorem 1.2, we wiU apply Theorem 1.1, but with as many as 0((logn)^) 
vertices removed from Gi at any time during the process. This added restriction has no effect 
on the proof of Theorem 1.1. Our approach is similar to that of [11] for the contact process on 
the complete graph with random edge weights. 

Proof. We intend to show that there exist constants 6 G (e, 2e] and r, C > so that 

l^ol =x]>l- e-^", 



(7) inf F l^rl > Sn, min > en 

x>Sn \ te[o,T] 

for all sufficiently large n. This means that with probability exponentially close to 1, if the size 

of the contact process initially exceeds 6n, then at time r the size of the contact process will 
again exceed 5n and will not have dropped below en along the way. By subdividing the time 
interval [0, e'^"/^] into e^"/^/r intervals of length r, this implies that 

leoi = x]< 



(8) sup P min < en 

x>&n V«6[0,e'^"/2] 

By Lemma 3.2, T^t < rj^logn with probability 1 — o(l) conditional on l^^l > 0. The Strong 
Markov Property and equation (8) imply the result with c = C/2. 

To prove equation (7), first we observe that by monotonicity of the contact process, it suffices 
to prove the statement with initial size x = Sn. We will actually prove the stronger statement 
that for 7 = min{e(l - 4e)6, 2e} (recaU that (1 - 4e)6 = 6^/^ > 1), there exist r, C > and 
S G (e, 7) such that 



(9) P (l^rl > 6n, 161 G [en, 7n] G [0, r] | l^ol = <5n) > 1 - e' 



-Cn 



The difference between the events in equations (7) and (9) is that in the latter the size of the 
contact process is also not allowed to exceed size 7n. 

The total jump rate of \^t\ is at most 0(n) for all t. So, for sufficiently small r > (depending 
on b), the probability that the number of jumps in the time interval [0,r] exceeds \{'J — e)n 
is at most e^*-"'" for some C > depending on r and b. If we choose 6 = ^{e + then 
l^ol = S'n implies that |6| G [en, 7n] for alH < r with probability exceeding 1 — e"*-^'". While 
161 G [en,7n], the maximum rate at which |6| — >■ |6| — 1 is 7n < e(l — 4e)6n < e(l — 3e)6n, 
which is the minimum rate at which |6| \S,t\ + 1 in this interval. Therefore, given that 
161 € [en, jn] for all t < t, |6| stochastically dominates a random walk with positive drift up to 
time T. Specifically, |6| stochastically dominates Xt, where Xt is the continuous time random 
walk that jumps to X( + 1 at rate e(l — 3e)bn and to — 1 at rate e(l — 4e)bn. By a standard 
large deviations argument for random walks, P^^ {Xj- < 5n) < where C" > depends 

on r and b. Choosing C < min{C",C"}, we have demonstrated equation (9) for all sufficiently 
large n, and thus completed the proof of Proposition 1.1. □ 
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4. Duality 



It is well known that the contact process is self-dual in the following sense. For any two sets 
of vertices A,BcV and t > 

(10) P (Ct^ n B / 0) = P (Cf n ^ 7^ 0) . 

For our purposes, the way to understand the contact process duality is through the graphical 
representation of the process (see, for example, Part I of [7], which we paraphrase here). To each 
vertex v e V we assign a rate 1 Poisson process with jump times {T^}^^, and to each ordered 
pair of vertices {u, v) joined by an edge in G {{u, v} € E) we assign a rate A Poisson process 

with jump times {T"^"'^^}^!- All of these Poisson processes are independent of one another. To 
construct the contact process graphically, we begin by drawing the space-time axes G x [0, oo). 
For each A; G N and v e V, we draw a recovery dot, •, at the point (vjTJ^) G G x [0, oo). For 
each k gN and ordered pair {u,v) such that {u,v} G E, we draw an infection arrow, — >■, from 

(«,r,("'''))to (t;,r^^)). 

We say that there is an active path from {vq, ti) to {v^, t2), with ti < t2, if there is a sequence 
of arrows between vq and V£, 

h < r(;°'^^) < r(^^^'^^) < • • • < tI^^-^'^'^ < t2, 

such that there are no recovery dots encountered along the way, 

{^r e [T^;'"'''''\ T^^;';f +i))} = = 1, . . . , ^ - 1 
{Tr e [ti,Ti:"'^^^)} = {t;:^ g [Tt;-'''"\t2]} = 0. 

Wc now have that v G if and only if there is a vertex u G such that there is an active 
path from (u, 0) to {v,t). Therefore, by tracing the arrows in reverse, we can determine for 
each V eV whether v e Because the Poisson processes determining arrows in each direction 
between pairs of adjacent vertices are iid, we have that the sets 

{u eV \ 3v e A s.t. there is an active path from {u, 0) to {v, t)} and 

{vEiV\3uGA s.t. there is an active path from (w,0) to {v,t)} 

are equal in distribution, which is equivalent to the self duality of the contact process as stated 
in equation (10). The dual process of the contact process can thus be viewed as a reversal of 
the arrows in the graphical representation. 

Let {ub,Vb} e E he the bridge edge between Vi and V2, such that Ub G Vi and Vb G V2. We 
will use the graphical representation of the dual of the contact process to determine whether the 
vertex Ub is infected at the times T^^'^^\ The idea is that by Theorem 1.1 there are at least en 
vertices infected in V\ at the times T^^'^^^ — r — tmix (where tmix = O(loglogra) will be defined 
later). By Lemma 3.1, the probability that the dual process started at {ub,T^^'^^^) survives to 

time T^^'""^"^ — r is approximately 1 — 1/6. If the dual process does survive, then by coupling 

the dual process with a random walk process, we will show that in time tmix (going backwards 
in time still) at least one of the active vertices in the dual process will coincide with one of 
the en infected vertices in the contact process at time J'^"'"^''^ — r — tmix with high probability. 
Therefore, Ub will be infected at time which will immediately result in Vb becoming 
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infected. In turn, starts a widespread infection in V2 with probabihty close to 1 — 1/6 by 
Lemma 3.1 and Theorem 1.1. 

To begin, we let Ct be the dual process on d started at {ub, tI"""""^). Note that {r^'"'"''''^}*; are 

random times, but they are independent of {7^"'"'}*; and {T^}k for all ordered edges {u, v) and 
vertices v uiG\. The interpretation of C,^ is that U}, G ^ (u,,,vf,) if and only if fl ^ {uf^,v0 7^ 0. 

k k ^ 

Also note that {C,^ n S 7^ 0) = P„j (^t n B 7^ 0) for all B dVihy the duality equation (10). 

Therefore we can apply Lemma 3.1 to C,^ to say that \ > with probability differing from 
(1 — 1/6) by at most ©((logn)^"*^), and when |^^'| > 0, > ?72logn with probability 1 — 
0((logn)~^). The purpose of the next lemma is to show, via a coupling with random walks, 
that whenever |Cr | > V^^ogn, we have Ub G ^ (u^,vt,) with high probability. 

k 

Lemma 4.1. If (j^ is the dual of the contact process started at (-Ub, T^"*"^'^), e > and tmix = 
log log n then for all 6 > 



sup (c'+t_ n ^ = Cr > ^2 log n) > 5 

AcVi,\A\>en ^ ' 



0. 



For the proof of Lemma 4.1 we will first need to construct the coupling between and a ran- 
dom walk process, then prove some facts about this process. Let = {X/, . . . , '°g"-| 
be the locations of 772 log ?^ independent, continuous time, simple random walks on G\ that 
independently die at rate 1. That is, for each i, X\ is the continuous time random walk on G\ 
that holds at a vertex « G Vi for time Exp(Adeg(w)), then jumps to ~ tt with probability 
1/ deg(u), and which has a total life span distributed as Exp(l). When a walker dies, we remove 
it from the set Xj. 

We can couple X^ with Cr+t until t = T^ie A Tcoiiidej where T^ie = inf{t : X^ = 0} (all of the 
random walkers die out), and Tcoiudc = inf{t : XI = Xf for some i 7^ j} (two random walks 
collide). First, let Xq be the i*'^ clement of the lexicographical ordering of the vertices in 
We then update X^ according to the infection arrows by always following the first arrow to 
arrive in the correct orientation (going backwards in time, so this is actually the most recent 
arrow infecting the current vertex at which the walker resides). That is, if XI is at the vertex u 
at time t, then X^ will jump to v at (random walk) time T^*"^^^ — r — T^'"'^^ if the arrow from 
u to u is the next encountered: 

t)^'") = maxmaxlrf | '"^ < t!:'""'"'^ -r-t}. 

If, while at the vertex u, XI encounters a recovery dot before an arrow into u (there is a j so 
that T^^'^^^ — T — t > TJ" > Tg"'^^) then the random walk dies. It is clear that this constructs the 
random walk XI as described above, since the waiting time until the first infection arrow into 
the vertex u is distributed as the minimum of deg(M) random variables with Exp(A) distribution, 
which is Exp(Adeg('u)), and the recovery dots appear at rate 1. 

Under this coupling Xt C Cr+t- When a collision occurs between two random walkers, 
however, our coupling would cause those walks to stick together for all time. To avoid this, at 
the time of the first collision, TcoUide; we stop the coupling between Xj and Cr+t^ and instead let 
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each of the random walks proceed independently (of the other random walks and of Cr+t)- This 
way, at time tmix the locations of the surviving random walks are independent, and as long as 
^mix < ^collide they are still coupled with Cr+t- 

The proof of Lemma 4.1 has three basic ingredients. First, many of the random walkers will 
survive for time tmix- Second, most of the random walkers will make at least /Cmix jumps by 
time tmix, so by Lemma 2.5 their locations will be almost uniformly distributed, so intersection 
with A is imminent. Finally, a collision is unlikely to occur before time tmix, so the random 
walk process will still be coupled with Cr+t- 

Proof. We assume that >V2 logn, A cVi with |^| > en is fixed, and we have the coupling 
described above between Xj and Cr+t- 

First we observe that the random walkers are mutually independent, and so are their death 
clocks. Since each random walker dies at rate 1, the number of random walks that survive to 
time tmix = 5^ log logn is Binomial(772 logn, (logn) ~^/(''+^)). So by Chebychev's inequality, 
the number of random walks that survive until time fmix is at least 5772 (log n)^^"^)/^^"'"^^ with 
probability 1 - 0((log n)-(^-i)/(^+i)). 

Next, we want all of the random walkers to make at least femix steps so we can apply 
Lemma 2.5. The probability that the i*^ walker, XI, jumps fewer than kmix times by time 
^mix (ignoring whether the walker survives to time tmix, as these events are independent) is at 
most 

/C<^/vinix 

= O ((logn)-2''/(''+i)(loglogn)'=--) 



since the minimum degree of Gi is at least n" — n^"/^ with high probability. Therefore, all 
of the surviving random walks will make at least /cmix jumps by time tmix with probability 
1 - 0((logn)-(^-i)/(''+i)(loglogn)'=--). 

Conditional on the events that Xf survives to time tmix and makes at least fcmix jumps, then 
the probability that Xl . G ^ is at least 

^(1 - 4n-«/=^) > e(l - 4n-"/^) 

for asymptotically almost every Gi by Lemma 2.5. Let E be the event that there are at least 

^?72(logn)^''~^^/*^*+^^ random walkers that survive to time tmix and make at least Ajmix jumps. 
Then the probability that none of the random walkers hits the set A is 

^ (^wx n ^ = 0) < P {Xt^,^ nA = m + P {E') 

< (1 _ ,(1 _ 4„-a/3))|r,2{logn)(''-i)/(''+i) + p (^c^ 

(11) = 0((logn)-(''-^)/(''+^)(loglogn)*^-='). 

Now we only need to check that the coupling between Xt and C^^^ has not been violated 
before time tmix- That is, we need to check that Tcoiusion > ^mix- Observe that TcoUision >- T' 
where T' ~ Exp (A (772 logn)^), since at any time there are fewer than \Xt\^ < (772 logn)^ directed 
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= 1 — exp 



-n " (log n)^ log log n 



edges connecting vertices occupied by the random walkers, and each directed edge has a rate A 
Poisson clock. Therefore, the probability that a collision occurs by time imix is 

^ (-^collision ^ ^mix) ^ ^ (T ^ ^mix) 

= 1 - exp [-tmixA(?72 logn)^] 
2bri 
b + 1 

(12) = 0(n"''(logn)^loglogn). 

Piecing together equations (11) and (12) shows that the coupling between ^^^^ and Xt will 
not be violated before time tmix, so Cr+t^ix ^ / ^ with probability 

1 - 0((logn)-(''-^)/(^+i)(loglogn)'=--) 

for asymptotically almost every Gi whenever |Cr | ^ 772 log n. □ 

5. Proof of Theorem 1.2 

Proof. First, we observe that by Lemma 3.1, P^q {\^r\ > 0) = 1 — | — o(l). Then by Proposi- 
tion 1.1, if \^r\ > then \^t\ > for all times t G [773 log n, e*^"] with high probability. Note 
that the number of vertices that are used up by the dual processes, {Ct}k, at any time is at 
most 0((logn)^) by Lemma 3.1 (when we observe the dual process, it is only allowed to grow 
for time r = O (log log n)). This is not a problem, since we have accounted for the fact that 
must avoid these vertices in equation (4). 

Now we consider the sequence of times at which the bridge edge may transmit the infection 
from Ub to Vb, {r^^"'''"''}fc. The interarrival times, Ti^""'""^ and tI'""'"'^ - rj^'^f') k>2, are 
independent with distribution Exp(A). If any of the first logn interarrival times are smaller 
than 2r+finix, then we will not have enough time to allow the dual process started at T^''^'"^'^ to 
be well mixed and guarantee that ^ (uf,,v0{ub) is essentially independent for each k. The reason 

*= 

for the 2r term is that once the infection reaches the second component, we need additional time 
r to see whether it survives in that component. Fortunately the probability that any of the first 
logn interarrival times is smaller than 2r + imix = O(loglogn) is at most ©(n^'* logn log logn). 

By Lemmas 3.1 and 4.1, the dual process C,^, started at space-time {ub,T^^'"^''^), will intersect 
with ^ with probability 1 — 5 — o(l). If successful, the result is that Ub is infected 

at time T^^'^^\ and therefore Vb G V2 becomes infected at this time. By applying Lemma 3.1 

and Theorem 1.1 to the infection in G2 started at Vb, this will lead to a wide-spread infection 
of G2 (with greater than en infected vertices in G2) by time T^^'"^''^ + rjs logn with probability 

1 — ^— 0(1). The first logn interarrival times of {r^"'"''''*}jt all exceed 2r-|-tinix = 0(log logn) with 
high probability, and conditional on this, the probability that the infection has not yet spread 



to the second component following the k^^ time T^^'^^^ {k < logn) is 1 — (l — ^ — o(l))^ 

The probability that N ~ Geometric^(l — | — o(l))^^ exceeds logn is 0{n~^^~^^'^~"^^^^^ ) , so 
we need not worry about the interarrival times exceeding 2r -|- imix- 



k 
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Therefore the distribution of the time 

T := M{t > : n F2I > en}, 

conditional on \^r\ > 0, is a sum of a Geometric ^(l — 5 — o(l))^^ number of independent 

Exp(A) random variables, which is Exp^(l — ^ — o(l))^ , plus some time O(logn) to account 
for waiting for the process to grow to size en in each subgraph. This completes the proof for a 
single bridge edge, since r is infinite when l^^l = 0, as P < 7^3 log n^ = 0(n'~" log n). 

The proof for the multiple-bridge case, \B\ = o(n"/ log n log log n), is nearly identical if we 
replace T^"'"''f) with Tf := min{T^^"'"''''^ : {ub,Vb) e B,Ub e Vi,Vb G V2}, the sequence of 
times at which some bridge edge attempts to transmit the infection. All that needs to be 
checked is that the first log n interaxrival times of consecutive transmission attempts are larger 
than 2r + tmix = O (log log n). This is straightforward to verify, and the details are left to the 
reader. □ 
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Appendix A. Branching Processes 

In this section we prove Lemmas A. 2 and A. 4 regarding the size of a supercritical, continuous- 
time branching process, Zt, in which each individual gives birth to a single offspring at rate 
/? > 1 and dies at rate 1. Let Pr (•) be the probability measure associated with this process, 
and for all i,j G Z>o and t > 0, let Pij{t) := Pr {Zt = j \ Zq = i) =: Pr^ {Zt = j). 

Lemma A.l. Let Zt he a branching process where each individual gives birth to a single off- 
spring at rate P > 1 and dies at rate 1. If Zq= [Slogan] =: a then 

Pro {Zt = eventually) < n~^. 
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Proof. li Zq = 1, then Pri {Zt = eventually) = [5]. If Zq = a then all of the families are 
mutually independent, so 



Pva {Zt = eventually) = < n"^. 

□ 



Lemma A. 2. Let Zt be a branching process where each individual gives birth to a single off- 
spring at rate j3 > 1 and dies at rate 1. When Zq = 1, if r = log log n and s = logn, 
then for any constants C, c > 

(13) Pri(0<Z^<Clogn) = O((logn)-^) 



(14) Pri \^Zr > ^(logn)3j = ©(n'^) 

(15) Pri {0<Zs< on) = ©(n'^) 
The proof uses the following bounds. 

Lemma A. 3. IfO<y<^ then 

(16) -y-y^< log(i-y) <-y-y- 

Proof. By using the series expansion: 

OO 

log(i-y) = -E|' 

k=l 

the upper bound in (16) is immediate by truncating after the second term in the series. Using 
that y < 1/2, the lower bound in (16) follows from: 

log(l - y) > -2/ - y (1 + y + + • • • ) 

>-y-|(2). 

□ 

We now prove Lemma A. 2. 

Proof. The transition probabilities for Zt can be computed exactly, as in Chapter V of [5] : 

1 _ e-(/5-i)* 

Pife(0 = [l-Pio(i)] [l-^(i)] V{t)'-' 
1 _ e-(/3-i)t 



v{t) 



1_ ie-(^-i)t- 



20 



So we have at time r = -g^j- log log n that 



C log n 

Pri(0<Z, <Clogn)= ^ Pife(r) 

fc=i 

(17) =[l-Pio(r)]- [l-r?(rf 

We apply Lemma A. 3 to obtain: 

log [l - e-^'^-^)^] = log [1 - (logn)-^] > -(logn)-^ - (logn) 



-4 



log 



1 



-(/3-l)r 



log 



1 - -^(logn) 2 



< -^(logn 



,-2 



whenever n > 5. Subtracting the second line from the first, then multiplying both sides by 
C log n gives 

logr/(r)^^°g" >-c(^l- (logn)-i - O ((logn)-^) . 

Combining equation (17) with this bound and the fact that 1 — e"^ < x for x > (this follows 
from the upper bound of Lemma A. 3) proves equation (13): 

Pri (0 < < Clogn) < 1 - exp [log7?(r)^^°s" 

<c(^l-^^ (logn)-i + O ((logn)-3) . 

Equation (15) is proved in the same way. To prove equation (14) we begin with 

2/3 



Pri Zr > 



P-1 



(logn)" 



fc>^(logn)3 

[1 - Ao(r)] •ry(r)^5^(^°s")'. 



Now applying Lemma A. 3 in the same way as above (but with the upper and lower bounds 
reversed) yields: 



Pri l^Zr > ^^{^ognfj < exp [logr/(r)^('°s")' 



< exp 



2/3 
/3-1 



logn + O ((logn) ^) 



= 0(n-2). 



□ 



Lemma A.4. Using the same setup as in Lemma A. 2, let rji 



(/3-l)log/3 



, then 



Pri ( sup Zt > r]i{logn)A = O (n'^) . 

\0<t<r J 
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Proof. For the duration of the proof of Lemma A. 4 let T := mi{t \ Zt > (logra)^}. Then the 
claim in Lemma A. 4 is equivalent to Fi [T < r) = O (^^~^) • Our strategy is to use equation (14) 
in Lemma A. 2 to say that can be at most 0((logn)^), then use the Strong Markov Property 
to say that if Zt exceeds 77i(logn)^ at any time t < r then it is unlikely to drop below size 
0{{\ogn)^) by time r. 



Pri (T < r) < Pri T < r, Zr < 



2/3 



< Pri Zr < 



2(3 



< Pr^i(logn)4 



inf Zt < 



(3-1 
{lognf 

2(3 



(logn 



Zr > 



t<r 



(3-1 



+ Pri 
T < rj + 0(n-^) 
(logn)^^ +0(n-2 

2/3 



2/? 
(3-1 



(logn) 



< 1 - [1 - Pr3iog,„ (Z. = 0)]?-t(^°^")' + 0{n-^) 



< 1 



^ ^(logn)3 



+ 0(n-2) = O {n-^) . 



We applied the Strong Markov Property and translation invariance of the branching process 
at the third line above. The fourth line follows by observing that for the branching process to 



transition from size 771 (log n)^ to size j^i^ogn)^, then at least one of the w!^{^ogn)^ sets of 



/3 



2/3 



Slogan individuals must go extinct. The fifth line follows from Lemma A.l. 



□ 
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